AN EXPLICIT INCIDENCE THEOREM IN Fp 

HARALD ANDRES HELFGOTT AND MISHA RUDNEV 

Abstract. Let P = ^x^cFpXFp,pa prime. Assume that P = Ax A has n 

elements, n < p. See P as a set of points in the plane over Fp. We show that the 

pairs of points in P determine > cn^^ 257 hnes, where c is an absolute constant. 

We derive from this an incidence theorem: the number of incidences between 

a set of n points and a set of n lines in the projective plane over Fp (n < yfp) is 
3 _ 1 

bounded by Cna loerg , where C is an absolute constant. 



1. Introduction 

In 1983 J. Beck proved the following incidence theorem in M^. Let P be a 
set of points in M?. Then either P contains c\P\ points on a straight line, or the 
pairs of points of P determine at least c|Pp distinct lines, where c is an absolute 
constant. (We write IS"! for the number of elements of a finite set S; c and C will 
henceforth denote some absolute constants, changing from one place to another.) 
Beck's original paper ([!]) was followed - in the same issue of the same journal - 
by a result of Szemeredi and Trotter stating that the number of incidences between 

m straight lines and n points in is O + n + (mn)3^ . The Szemeredi- Trotter 
theorem implies Beck's theorem as a corollary. 
We prove a finite field version of Beck's theorem. 

Theorem 1. Let A C Fp, where p is larger than an absolute constant. Let P = 
Ax A C Fp. Let L = L{P) he the set of all lines determined by pairs of points of 
elements of P. If \A\ < yjp, then 

where c > is an absolute constant. 

All constants here and later are independent of p. 

The statement > clPl^^'^zsr cannot possibly hold in Fp in full generality, 

as P = Fp itself generates only 0(|P|) lines. A non-trivial theorem of Beck type is 
implicit in the well-known paper of Bourgain, Katz, and Tao ([3]) - namely, their 
results imply that, if P C Fp is a Cartesian product and |P| < p'^^'^ (A > 0), then 
|L(P)| = O (^\P\^~^^^^^^ for some 5{A) > 0. However, there was no attempt to make 
explicit or optimise S{A). 
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The problem of studying L{P) is an incidence problem, and [1] proves that for 
n < p'^~^, the number of incidences between n lines and n points is O ^|n| z"*^!*^^)^ . 
This statement happens to be on about the same level of generality as claiming 
that a set of n points that is a Cartesian product A x A (with n = < 
determines 17 (n^"'"''^*-^'') distinct lines. These results in [4J stem from a non-trivial 
sum-product estimate in ¥p proven in [1]. The sum-product estimate of [4J (the 
following formulation also includes Konyagin's contribution [7J) says that, if A + A 
and A ■ A denote, respectively, the set of all sums and products of pairs of elements 
of A C ¥p, then, as long as |A| < one has 

(1.1) max(|^ + ^|, l^-^l) >c\A\^+^^^^\ 

for some absolute c. The quantitative relation between the deltas was not estab- 
lished. 

(At this point one traditionally mentions the Erdos-Szemeredi conjecture, which 
states that, if ^ is a subset of integers, then max(|A + A\, \A ■ A\) > c\A\^~^^ holds 
for any < 6 < 1, where c is allowed to depend only on 6, but not on \A\.) 

We return to ¥p for the rest of the paper. On the level of the existence of positive 
exponents, a nontrivial sum-product estimate implies a non-trivial incidence or Beck- 
type theorem, and conversely. Garaev [6] succeeded in obtaining a quantitative sum- 
product estimate in F^: for a small enough (say \A\ < ^) subset A of Fp, either 

A+ A ov A- A has cardinality at least l^l^"*"!* , up to a multiple of a power of clog \A\. 
Katz and Shen ([8]) elaborated on a particular application of the Pliinnecke-Ruzsa 
inequality in Garaev's proof and improved the result to lAI^"*"!^, up to a multiple of 
a power of clog|A|. Bourgain and Garaev ([3]) incorporated a covering argument 
(whose variant we cite as Lemma [3]) and improved the estimate to |^|^^T2, up to a 
multiple of a power of clog |^|. Li ([9j) showed that a multiple of a power of log 1^41 
can be done away with; thus the best result known states that, for |^| < .Jp, 

max(|^ + ^|, l^-^l) > c|A|^+ii, 

where c is an absolute constant. 

Our construction uses the techniques laid out in the above-mentioned papers and 
locally follows rather closely the exposition from some of those papers. The main 
result. Theorem 1, an explicit Beck type incidence theorem, implies the following 
incidence theorem for n points and n straight lines in the projective plane P^(Fp). 

Theorem 2. If P and L are sets of points and lines in P^(Fp) with \P\, \L\ = n < p, 
then the number of incidences 

I{P,L) = \{{p,l) e Px L: pel}\< Cn^-Tom 

for some absolute C . 

We call {(p, /) G P X L : p G /} the set of incidences. 

The proof of this theorem repeats the pigeonholing argument of ^ until it merges 
with the proof of our Theorem [H It is given at the end of this note. 
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Remark 3. In both Theorems [T] and [2] one can easily extend the estimate to larger 
sets, by a straightforward adaptation of Case (i) of the estimate (|3.19p in the end 
of our proof. We haven't done so aiming at an estimate which does not contain p 
explicitly; we leave the case of "larger" sets to the interested reader. 

1.1. Acknowledgments. H. A. Helfgott is supported in part by EPSRC grant EP- 
E054919/1. The authors would like to thank M. Garaev, T. Jones and O. Roche- 
Newton for their helpful comments on the draft of this paper. 



2. Background in arithmetic combinatorics 

We use the following largely standard arithmetic combinatorics lemmata. In 
the sequel, in order to suppress constants, we will use the ~ notations in 

estimates: |A| ^ \Y\ means |A| > c\Y\ for some c, |A| <C means |A| < c\Y\ 
for some c, |A| ~ \Y\ means |A| < Ci|y| and \X\ > C2\Y\ hold for some Ci, C2- 
We abuse the English language in accordance with these notations by saying "at 
least", "at most", or approximately in the sense conveyed by the symbols 
, ~, respectively. To avoid confusion, we will enclose "at least" and "at most" in 
quotation marks when we use them in this way. 

Thus, for example, saying that \X\ is "at least" |y| means \X\ ^ \Y\. 

We adopt the following formulation of the Balog-Szemeredi-Gowers theorem. 

Lemma 1 (Balog-Szemeredi-Gowers theorem). Let X,Y be additive sets of n el- 
ements, and a G (0,1). Suppose that there is a set of ar? pairs of elements 
(x, y) G X X Y on which the sum x + y takes at most n distinct values. 
Then there exist subsets X' ^ A, y' C Y , with \X'\, \Y'\ ^ an, such that 



\Range of x + y on X' x Y'\ ^ a 



-5 



n. 



The modern graph-theoretical proof of the Balog-Szemeredi-Gowers theorem can 
be found in [13, Thm. 2.29] . The proof as appears in that standard reference appears 
to have a typographical error, however, which leads to an exponent of —4, rather 
than to the correct (and somewhat weaker) exponent of —5 that we have above. For 
a proof yielding the exponent —5, see [5]. 

We adopt the following form for the Pliinnecke-Ruzsa inequality, due to Ruzsa 

m)- 



Lemma 2. Let Y; Ai, . . . A^ be additive sets. Then there exists a non-empty subset 
Y' C Y, such that 

(2.1) |y' + Ai + ... + A,| < nkll±^|y'|. 
Ruzsa's inequality immediately implies that 

(2.2) |Xi + ... + A,|< n-il^ + ^^l 
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for any "dummy set" Y. To alow for — signs as well + signs, this inequality is often 
used in conjunction with the Ruzsa distance inequality 



(2.3) 



1^3 



(See, e.g., [131 Lemma 2.6] for a brief proof of ()2.3p .) 

Remark 4. Katz and Shen ([8]) showed that, at the expense of acquiring a constant 
in the left-hand side of (j2.ip . one can make Y' contain an arbitrarily large propor- 
tion of Y. This trick was used to gain an improvement in ([8]) and subsequent 
above-mentioned papers on the sum-product problem. We have not found a way to 
take advantage of it here, as we shall be dealing with set families, controlling their 
intersections, refinements therefore being apparently forbidden. I.e., we essentially 
use Pliinnecke- Ruzsa only in the "crude" form (j2.2p . 



Finally, we need the following covering lemma (see e.g. [3], [TT], [9]; the proof is 
a Cauchy-Schwartz type averaging argument). 

Lemma 3. Let Xi and X2 he additive sets. Then for any e G (0, 1) and some 



constant C{e), there exist 



C(e) 

\X2\ 



min(|Xi -|- X2I, |Xi — X2I) translates of X2 whose 



union contains not less than (1 — e)|Xi| elements of Xi. 

3. Proof of Theorem [T] 

A1XA2 C FpXFp, l^il 



n. 



We will prove the result more generally for P 

Let L{P) be the set of straight lines generated by pairs of elements of P . Suppose 
that 

(3.1) |L(P)| 

where b < We will show how to reach a contradiction. 

The approximately pairs of distinct points of P are distributed between ap- 
proximately n^"'"^'^ lines. This implies that a positive proportion of approximately 

pairs of those points are supported on rich lines, meaning lines with "at least" 
n^~^ points on each. These rich lines thus contain "at least" n^^^ collinear triples 
of distinct points of P. 

Hence, the equation 

111^ 

0, xi,X2,X3 G Ai, 2/1,7/2, 2/3 G A2 



(3.2) 



Xi X2 X3 

yi y2 ys 



has "at least" ^ solutions. Then, for some fixed and non-equal 2/1,2/2 £ A2, the 
equation 

'111^ 

0, xi,X2,X3 e Ai, 2/3 G A2 



(3.3) 



Xi X2 

yi 2/2 



X3 
2/3 



AN EXPLICIT INCIDENCE THEOREM IN ¥p 



5 



has "at least" n^^^ solutions. Since we can translate and dilate A2, we can assume 
yi = 0, y2 = 1 without loss of generality. Then the condition (j3.3p turns into a claim 
that 

(3.4) xi{l - y-s) + X2y3 ^ Ai xi,X2 £ Ai, £ A2 

happens for "at least" n^~^ triples {xi,X2,y3). Assuming 2/3 7^ 0, 1 does not change 
the situation, as long as 5 < 1. Let us define B as the set of elements of the form 
b = T^j! 2/3 £ ^2 \ {0, 1}. Clearly \B\ \A2\ = n and B does not contain zero. 

(The set-up here is similar to that of Theorem C in Bourgain ([2]), which would 
at this point yield the existence of 5, with a possibility of its quantitative estimate. 
However, if one chases through Bourgain's arguments, the value of 5 appears to be 
considerably smaller than what we obtain.) 

Let Bi be the set of popular elements of i?, meaning the set of all 6 € i3 such 
the equation ()3.4p has "at least" n^~^ solutions (xi,X2,y3) £ Ai x Ai x A2 with 
= h. By the pigeonhole principle, 

\Bi\->n^-^. 



Besides, fixing h fixes 2/3, so for each 6 G i?i the condition 

xi + hx2 G ^—fttM 

1 - Vz{b) 

holds for "at least" 'n?~^ pairs (xi,X2) G ^1 x Ai. 

Applying the Balog-Szemeredi-Gowers theorem for each 6, yields the existence of 
the subsets Al and Al of Ai, with \ A\\ > lAil^""^ = n^^\ i = 1, 2, such that 

(3.5) \Al + bAl\^n^+^^. 
Let us restate that 

(3.6) \Al\,\All\Bi\:^n^-\ 

Let Al, = A\ X j4^. In view of (j3.6p and by Cauchy- Schwartz, since each A^, is a 
subset oi Ax A, 



SO for some 6* G Bi (which can be assumed non-zero), denoting A^ = A[,^, we have 

(3.7) ^ |An^,| > l^iln^-^-^. 

Let B2 be a popular subset of namely a set such that for all b B2 we have 

(3.8) AbA* = |An^*| > n^-^^; 
clearly, 

(3.9) \B2\ > n^-^\ 
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Besides, each ^(,a* is a cartesian product: 

AbA* = ^6A* ^ ^6A*' 

and therefore, for i = 1,2 

(3.10) AU^^AlnAi » n'-^'. 



We now apply the Pliinnecke-Ruzsa inequahty (j2.2p with A; = 2, as well as (|3.5p . 
and the set cardinality estimates (|3.6p and (j3.10p to draw the following conclusions. 
For each b £ Bi we have 

Furthermore, for each b £ B2: 

\b,Al + bAl\< l^*-4^ + ^*-4bAJI^*-4gA. + ^-4g| ^m|j^^2^^ + ^^2| 

I 4 1 I ' 
l^6A*l 

by (ESI), (ICTl . (imH . and (l33]) . 
Then 



and 



(3.12) + < ^^^^ ''^* 

Throughout the rest of the proof, to save on the number of indices used, let us 
refer to Al as X and to b~^B2 as Y. 
Let us use the symbol 

(3.13) K = max\X + yX\, so K <^ n^+^^^ . 

yeY 

Consider the equation 

(3.14) X2 + yxi = X2 + yxi, xi,X2,xi,X2 £ X, y e Y. 

This equation has at least - — j^-^ solutions, as follows by applying Cauchy-Schwartz 
for each individual Y and then summing over y £ Y. Equation (j3.14p is equivalent 
to 

a;2 - ^2 = y{xi - xi), xi,X2,xi,X2 £ X, y gY. 
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Hence, for some fixed {Xi,X2) £ X x X, the above equation has at least - — j^-'- 
solutions. Let Xi = X — xi, X2 = X — X2 he translates of X by xi and X2, 
respectively. The equation 



(3.15) v = yu, u £ Xi, V £ X2, y £Y 

has at least - — j^-^ solutions. Consider the set Xi x X2 in F^. The bound we have 
just given as to the number of solutions of equation (j3.15p can be rephrased as 
saying that the set of straight lines through the origin with slopes in Y makes at 
least - — -j^-^ incidences with Xi x X2. 

In the remainder of the proof will assume that ^ 1, i.e., that the lines in 
question contain "at least" one point each on average. This follows immediately 
from (|3.6p and (j3.13p once we assume 6 < 

Not less than 50% of the incidences specified by ()3.15p are contributed by rich 

lines with "at least" '-j^ points thereon. The number of rich lines is not greater 
than I y I, and "at least" 

Those "at least" - — -j^-^ points of Xi x X2 lying on rich lines can have \X\ different 
abscissae. Hence, there is a vertical set x X2 for some non-zero G Xi intersected 
by "at least" ' ' rich lines. 

Thus, we have a subset Yi C {X2 n u^Y) of cardinality 

(3.16) 

In the original notations, Yi lies in the intersection of u^b^^B2 and some translate 
of A1; besides, 

(3.17) |yi|>ni-65^ 

by the bounds ([Ml), ([SJD, (|XT2]) . 

Let R be the set of all elements expressed via r = where p, q, s, t are elements 
of and s ^ t. Let us consider two cases: (i) \R\ > jYip and (ii) < Since 
n < ^/p, = Fp is a possibility only in case (i). 

Let us consider Case (ii) first. For any ^ ^ R and any non-equal yi,y2 £ ^i, the 
sum yi + ^y2 has a single realisation, as 

(3-18) yi + ^y2 = y'l + ^y2, yi,y2,y'i,y2 ^Yi 

would imply ^ = Thus, for every subset Y( C Y and any nonzero ^ R, 

(3.19) |y/ + ^y/| = |y/|2. 

For any y ^ we have some p,q,s,t £ Yi, such that ^ = 2^ -)- y lies in the 
complement of R, for otherwise R + {y} = R, which is possible only if R = ¥p. In 
particular, this holds for y = 1. Recall that Yi is a subset of ti*6^^i?2, and so we 
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may regard p, q, s, t as elements of i?2- We have, then, some fixed p, q,s,t E B2 such 
that 

(3.20) \Yi\^ < \Yi + ^y;\ < \y; + y; + ^^r/i, 

s — t 

for any Y( C Yi, that constitutes a positive proportion of li, to be chosen next. 

We now use Lemma [3l Let us first show that for any b €z B2 we can cover 
99% of the elements of the sets bYi (a subset of a translation of bAl) or —bYi (a 
subset of a translation of —bAl) by "at most" n^^*^ translates of the set Al. Indeed, 
Al^^ = Al n Al is a subset of Al, and by Lemma [3] and (|2.2p . we can cover 99% of 
the elements of either bYi or —bYi by "at most" 

.oo^^ + bYi\ ^ \Al^, + bAl\ ^ \Al^^ + bAUlbAl^^ + bAl\ 

^ iT^n iT^n 1142 I ^ 

translates of Al^^, and hence of Al. In the last estimate we've used (jS.Sp . (jS.lOp . 
and (I3TT]1 . 

This altogether enables us to choose Y( as a subset containing at least 50% of 
Yi, and such that {p — q)Y( gets covered by "at most" n^^^ translates of Al + Al. 
Let us now, in the same vein, be a a subset containing at least 50% of A^, such 
that (s — t)Al gets covered by "at most" n^^^ translates of j4J + ^4;^. Then we apply 
Pliinnecke-Ruzsa to (|3.20p as follows: 



\y; + y; + ^^y;\ « 



Al + Yl + Yl\\Al + ^Yl\ 



(3.22) \Al + Al + Al\ ^~^ , P-Q^;i 

«n'''\Al + P^Y(\, 
s — t 

after applying Pliinnecke-Ruzsa with fc = 3 and the "dummy set" b~^Al, using p.6p . 
The covering argument above implies that 

\Al + P^Yll « n^^V: + a: + + Ail « n^+n^^ 
s — t 

by applying Plhnecke-Ruzsa with the "dummy set" b^^Al and k = A, using (|3.5p and 

die]). 

Therefore returning to (j3.20p . we have 

(3.23) < ni+i375_ 

Comparing this with (j3.17p . and recalling that Y( contains at least half of the el- 
ements of Yi, we conclude that 267(5 > 1, so (5 > This ends Case (ii) and 
essentially ends the proof of Theorem [TJ 

Indeed, to analyse Case (i) we observe that if \R\ > jllp, then, summing the 
number of ordered quadruples (yi , 2/2 1 y'l , 2/2 ) elements of Yi satisfying equation 
(j3.18p over all ^ G we get no more than 2|i?| |y/p for the total number of solutions, 
pentuples {yi, 1/2, y'l, 1/21 0- Indeed, given ^, the solutions can be either trivial, with 
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(yi)2/2) = {yijy'2) ^ot- The total number of non-trivial solutions is since 
such a solution determines the total number of trivial ones is Under 
the assumption of Case (i) the number of trivial solutions dominates the number 
of non-trivial ones. Hence, by the pigeonhole principle and Cauchy-Schwartz, there 
exists ^ = 1^ e R such that 

|y/|4 

for every subset C Yi with \Y-[\ ^ |yi|. 

This amounts to a simplification of (|3.20p , with Y' replacing Y' + Y' in its right- 
hand side. The ensuing estimates, done in exactly the same way as in Case (ii) are 
therefore better - by a factor of n^^^ , as follows by inspection of (j3.2ip and (j3.22p . 

This ends the proof of Theorem [TJ □ 

4. Proof of Theorem [2] 
We follow [1], Section 6. Suppose, for contradiction, that for some {P,L) one has 
(4.1) /(P,L)«nt-% 

for some e > 0. We shall give the lower bound for such e. Note that the role played 
by the parameter n in this theorem is different from the proof of Theorem [H 

Let us first off erase the points in P incident to more than Cna"'"'^ lines of L, 
without changing the notations {P,L). This can be done, as the maximum number 
of incidences that can come from the set P+ of such points is 




since any two distinct lines of L meet at at most a single point of P+ . (Here we use 
the notation b^i = 1 if the point p is incident to the line (5pi = otherwise.) 

This having been done, let Pi be the set of popular points of P, each incident to 
"at least" (recall that saying "at least" implies a suitable constant c) n"'^ lines of 
L. We have I{Pi,L) ~ I{P,L) by the following argument: 

I{P,L)-I{Pi,L) = ^ ^<5p,, < ^cni-^ < cni-^ < ^/(P,^) 

PiPi 

for c small enough. 

We can now refine L to the subset Li of popular lines, each incident to "at 
least" na"*^ points of Pi. By the pigeonhole principle, it still contributes a positive 
proportion of incidences. Let P2 C Pi with respect to Li. (That is, p G Pi is an 
element of P2 if and only if it is incident to "at least" n^"*^ lines of Li.) 
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Once again, I{P2,Li) satisfies (14. ID . (The refinement process could be iterated 
any finite number of times, with the constants obviously getting worse.) 

For p G P2, let Pp be a subset of all points of Pi, which are connected to p by 
some line from Li. By definitions of P2-,Li we have \Pp\ ^> n^"^*^, for each p S P2. 
Thus, by Cauchy-Schwartz 

\P2W~^' 

and so 




P,peP2 p,psP2 P6P2 

P¥=p 



3 1 

Now since I{P2,Li) w nz""^ and each point of P2 is incident to at most ^ na" 
lines of L, we have IP2I ^ n^~'^'', and thus 



E 



Therefore one can fix some G (P2 x ^2), P P-, such that 

^2-4e 

P3 = \Pp n Ppl > -— > nl-'^ 
l^il 

Each point of P3 C P^ is incident to "at least" na"*^ lines of the original set 
of lines L. Without loss of generality, after a projective transformation we can 
place the points p and p on the line at infinity, so that the "at most" nz"'"'^ lines 
of L\ emanating from these points can be viewed as being parallel to the x and y 
coordinate axes. In other words, for some A, B of cardinality "at most" na"'"'^ each, 
we have a subset P3 of ^ x P of cardinality "at least" such that the number 

of incidences of P3 with L is 



-be 



(4.2) I{P3,L):^n 

Now, the number of triples of points of A x P, which are collinear on some line 
from L is then (by Holder's inequality or simply noticing that the smallest number 
of triples is achieved with ()4.2p as equality, with n lines, each supporting the same 

number of points) "at least" n^" ^ \A\ i+2s . We can now merge with the proof 
of Theorem [T] at its claim (13. 2p , with e = ^^^25 — iMis ■ 
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